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In the presence of large uncertainties, a control system needs to be able to adapt rapidly to regain perfor- 
mance. Fast adaptation is referred to as the implementation of adaptive control with a large adaptive gain to 
reduce the tracking error rapidly; however, a large adaptive gain can lead to high-frequency oscillations which 
can adversely affect the robustness of an adaptive control law. A new adaptive control modification is presented 
that can achieve robust adaptation with a large adaptive gain without incurring high-frequency oscillations as 
with the standard model-reference adaptive control. The modification is based on the minimization of the J&2 
norm of the tracking error, which is formulated as an optimal control problem. The optimality condition is 
used to derive the modification using the gradient method. The optimal control modification results in a stable 
adaptation and allows a large adaptive gain to be used for better tracking while providing sufficient robustness. 
A damping term (v) is added in the modification to increase damping as needed. Simulations were conducted 
on a damaged F-18 aircraft (McDonnell Douglas, now The Boeing Company, Chicago, Illinois) with both the 
standard baseline dynamic inversion controller and the adaptive optimal control modification technique. The 
results demonstrate the effectiveness of the proposed modification in tracking a reference model. 


Nomenclature 

C ma coefficient of pitching moment due to angle of attack, 1/deg 

a angle of attack, deg 

p angle of sideslip, deg 

0; weight update law 

e tracking error rate 

c o frequency rad/s 

<I>(jc,z) basis function 

0/ weight law 

£ damping ratio - 

A state derivative matrix 

An subset derivative matrix for /?, q , and r states 

A 12 subset derivative matrix for </>, a, j3, V, h and 6 states 

B control derivative matrix 

e tracking error 

K constant gain 

Ki integral gain 

K p proportional gain 
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p body axis roll rate, deg/s 

PID proportional-integral-derivative 
q body axis pitch rate, deg/s 

r body axis yaw rate, deg/s 

s Laplace operator 

x aircraft state(s) 


I. Introduction 

The objective of this paper is to show the implementation of a fast adaptive controller without high-frequency 
oscillations. Adaptive control is a potentially promising technology that can improve performance and stability of a 
conventional fixed-gain controller. In recent years, adaptive control has been receiving a significant amount of atten- 
tion. In aerospace applications, adaptive control has been demonstrated in a number of flight vehicles. For example, 
the National Aeronautics and Space Administration (NASA) has recently conducted flight-testing of a neural network 
intelligent flight control system on board a modified F-15 (McDonnell Douglas, now The Boeing Company, Chicago, 
Illinois) test aircraft. The ability to accommodate system uncertainties and to improve fault tolerance of a control 
system is a major selling point of adaptive control since traditional gain-scheduling or fixed-gain control methods are 
viewed as less capable of handling off-nominal operating conditions outside of a normal operating envelope. Nonethe- 
less, these traditional control methods tend to be robust to disturbances and unmodeled dynamics when operated as 
intended. 

Over the past several years, various model-reference adaptive control (MRAC) methods have been investigated. _l 1 
The majority of MRAC methods may be classified as direct, indirect, or a combination thereof. Indirect adaptive 
control methods are based on identification of unknown plant parameters and certainty-equivalence control schemes 
derived from the parameter estimates which are assumed to be their true values. 1 Parameter identification techniques 
such as recursive least-squares and neural networks have been used in indirect adaptive control methods. In con- 
trast, direct adaptive control methods directly adjust control parameters to account for system uncertainties without 
identifying unknown plant parameters explicitly. 

Model-reference adaptive control methods based on neural networks have been a topic of great research interest. " 
A paper by Rysdyk and Calise described a neural network direct adaptive control method for improving tracking 
performance based on a model inversion control architecture. This method is the basis for the intelligent flight control 
system that has been developed for the F-15 test aircraft by NASA. In a paper by Johnson et al. the authors introduced 
a pseudo-control hedging approach for dealing with control input characteristics such as actuator saturation, rate limit, 
and linear input dynamics. Hovakimyan et al. developed an output feedback adaptive control to address problems 
with parametric uncertainties and unmodeled dynamics. Cao and Hovakimyan developed an J&fj adaptive control 
method with a low-pass implementation. 

While adaptive control has been used with success in a number of applications, the possibility of high-gain control 
due to fast adaptation can be a problem. It is well known that high-gain control or fast adaptation can result in high- 
frequency oscillations which can excite unmodeled dynamics that could adversely affect the stability of an MRAC 
law. In certain applications, fast adaptation is needed in order to improve tracking performance when a system is 
subject to a large source of uncertainties such as structural damage to an aircraft that could cause large changes in 
system dynamics. In these situations, large adaptive gains can be used in order to reduce the tracking error rapidly; 
however, typically, there exists a balance between stability and adaptation. 

Various modifications were developed to increase robustness of MRAC by adding damping to the adaptive law. 
Two well-known modifications in adaptive control are the cr- modification and £\ - modification. These modifica- 
tions have been used extensively in adaptive control. This paper uses a new adaptive law based on an optimal control 
formulation to minimize the JS ?2 norm of the tracking error. The optimality condition results in a damping term 
proportional to the persistent excitation. The analysis shows that the adaptive optimal control modification can al- 
low fast adaptation with a large adaptive gain without high-frequency oscillations and can provide improved stability 
robustness while preserving the tracking performance. 

This paper demonstrates a practical implementation of the optimal control modification in an adaptive flight control 
architecture on the F-18 (McDonnell Douglas, now The Boeing Company, Chicago, Illinois) aircraft model. Simula- 
tion results are presented to demonstrate potential benefits of the new adaptive control law. The layout of the remaining 
paper is as follows: Section II, Adaptive Flight Control; Section III, Simulation Results on an F-18 Aircraft Model; 
Section IV, Conclusions; and Section V, Appendix. 
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II. Adaptive Flight Control 


Consider the inner-loop adaptive flight control architecture shown in Fig. 1. The control architecture comprises: 
a reference model that translates rate commands into desired acceleration commands; a proportional-integral (PI) 
feedback control for rate stabilization and tracking; a dynamic inversion controller that computes actuator commands 
using desired acceleration commands; and a neural network direct MRAC with the optimal control modification. 



Figure 1. Adaptive flight control architecture. 


A. Adaptive Optimal Control Background 

Given a nonlinear plant as in Eq. (1) 

X{t)=Ax(t)+B[u(t)+f(x(t))] (1) 

where x(t ) : [0,oo) — > R w is a state vector, u(t) : [0,©o) R p is a control vector, A E R nxn and B E R nxp are known 
such that the pair (A,B) is controllable, and : R n -» R p is a matched uncertainty. 

Assumption 1: The uncertainty f(x(t)) can be linearly parametrized using a set of basis functions in the form as 
shown in Eq. (2) 

/(*(0) - 1 (x(t)) + e(x(t)) = ©* T <t >(x(t)) + e(x(t)) (2) 

1=1 

where 0* E R mxp is an unknown constant weight matrix that represents a parametric uncertainty, (* ( t )) : R n — > R m 

is a vector of known bounded basis functions that are continuous and differentiable in jc, and e (x (f)) : R n — > R p is an 
approximation error. 

The set of basis functions <&(x(t)) is chosen such that the approximation error £ (jc (f)) is small such that ||e (jc(f))|| < 
£o for all* (t) G^CR”. The universal approximation theorem for sigmoidal neural networks by Cybenko can be used 
for selecting a good set of basis functions (* (t )). 1 Alternatively, Micchellfs theorem provides the theoretical basis 
for a neural network design of 0 T (r) <£(*(?)) using radial basis functions to keep the approximation error £(*(?)) 
small. 

The reference model is specified by Eq. (3) 

-*tw (0 = A m x m (t) T* B m r ( t ) (3) 

where x m ( t ) : [0,oo) — ► R' 1 is a reference state vector, r(t) : [0,t») — > R r e JSfoo is a bounded piecewise continuous 
command vector, A m E R nxn is Hurwitz, and B m E R nxr with r < n. Since r(t) is bounded, x m ( t ) can be shown to be 
uniformly bounded such that Eq. (4): 


Ikn (0)11 < £ =» \\x m (Oil < s (£) , Wt > 0 (4) 

The objective is to design a full-state feedback adaptive control to enable *(r) to follow x m (t) with the controller 
shown in Eq. (5) 

u (0 = K x x (0 + K r r (0 - u ad (0 (5) 

where K x E R pxn and K r E R pxr are gain matrices and ii ad (t) E R p is a direct adaptive signal which estimates the 
parametric uncertainty in the plant such that Eq. (6) 

Uad (0 = © T (0<*>(*(0) (6) 
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where 0(f) £ R mxp is an estimate of the parametric uncertainty 0*. 

Assumption 2: There exist K x and K r such that the model matching conditions shown in Eqs. (7) and (8) are 
satisfied 

A m = A + BK X (7) 

( 8 ) 

Let 0 (t ) = 0 (f ) — 0* be an estimation error of the parametric uncertainty and define the tracking error as e (t) = 
x m it) —x(t), then the tracking error equation becomes Eq. (9). 


e(t)= A m e ( t) + B 0 T (t ) <t> {x (t ) ) - e (x (t ) ) 


(9) 


Adaptive Law : The adaptive law shown in Eq. (10) provides an updated law that minimizes ||e(f)||^ 2 


©(f) = -r<J>(jf(r)) (t)P- v<I> T ( x(t))&(t)B T PA 


B 


( 10 ) 


where T = r T > 0 £ R mxm is an adaptive gain matrix, v > 0 £ R is a weighting constant. The second part of Eq. (10) 
contains the damping term (v) and is used as an additional tuning term. The response without any damping may be 
too low in some cases; v can then be used to improve the robustness. Also, P = P T > 0 £ R nxn solves Eq. (11) 


PA m +A T m P = -Q 


(ID 


where Q = Q J > 0 £ R nxn . A proof can be found in the appendix. 

Choosing a Lyapunov candidate function of the form shown in Eq. (12) 

V (f) = e T (t)Pe(t) + trace (© T (r)I~'&(f)) (12) 

allows the tracking errors to be shown to have the following ultimate bounds. 

Theorem: The adaptive law shown in Eq. (10) results in stable and uniformly ultimately bounded tracking error 
e ( t ) for all ( e (0) ,0 (0)) £ Wp with an ultimate bound as shown in Eq. (13) 


P = 


/ wn^ + wr- 1 )^ 

hnirt (P) 


where Eqs. (14) and (15) 

. _ 2\\PB\\eo 

^ Knin ( Q ) 

2||g T P4- | g||0 o 
Kim {fa^qa-'b) 

with £o = sup, ||e(r)|| and 0 q = ||0*||. See the appendix for a full proof and further definitions. 


(13) 


(14) 

(15) 


B. Implementation of Adaptive Optimal Control on the F-18 Aircraft 


Adaptive flight control can be used to provide consistent handling qualities and restore the stability of an aircraft under 
off-nominal flight conditions such as those due to failures or damage. The linearized equations of motion are expressed 
as shown in Eqs. (16) and (17) 

x = A\\x -\-A]2Z-\- B\u f\ (x,z) (16) 

Z=A 2 \X+A22Z + B 2 U + f2(x,z) (17) 


where An, A 12 , A 21 , A 22 , B 1 , and #2 are nominal undamaged plant matrices which are assumed to 


p q 


is a vector of roll, pitch, and yaw rates; z 


A 0 Aa Ap AV Ah A0 


-iT 


of perturbation in the bank angle A0, angle of attack Aa, sideslip angle A/3, airspeed AV, altitude A/?, and 


be known; 
is a vector 
pitch angle 
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A 8 U A 8 e A 8 r j is a vector of additional aileron, elevator, and rudder deflections; and // (jc,z), / = 1,2 

(18) 


A 6, u = 

is an uncertainty due to damage which can be approximated as shown in Eq. (18) 

//(x,z) = 0* T 4>(.r,z) + £ 

where 3>(x,z) is a basis function chosen to be as shown in Eq. (19). 




jc t px T qx T rx T z T u T (x,z) 


-\ T 


(19) 


The coupled basis function presented here is used to hopefully capture nonlinear error functions. If the damage 
response is highly nonlinear, then to model it accurately, a nonlinear function is a better fit than a simple linear 
function. 

The inner-loop rate feedback control is designed to improve aircraft rate response characteristics such as the short- 
period mode and the Dutch roll mode. A second-order reference model is specified to provide desired handling 
qualities with good damping and natural frequency characteristics as shown in Eqs. (20) through (22): 


( 20 ) 

(21) 


( 22 ) 


(j 2 + 2 £ p (OpS + (Op) <j) m — g p Sfat 

(*£ T* ^^,q(OqS -f- (Qq ) 0 m = gq&lon 
( s 2 + 2£ r O ) r S + G)“) p m = gr^rud 

where 0 /n , Q m , and p m are reference bank, pitch, and sideslip angles; co p , co q , and co r are the natural frequencies for 
desired handling qualities in the roll, pitch, and yaw axes; and are the desired damping ratios; 5/^, 5/ ow , and 

S ru j are the lateral stick input, longitudinal stick input, and rudder pedal input; and g p , g q , and g r are input gains. 

Let p m = (j>m, q, n = and r m = —p m be the reference roll, pitch, and yaw rates. Then the reference model can be 
represented as shown in Eq. (23) 

x m — K p x m Ki I %md T T - Gk (23) 

Jo 

,K P = diag (2t^ p (0 p ,2t; q (0 q , 2 £ r (O r ), K-, = diag ((0^,(0^co}),G = diag (g p ,g q , - g r ), and 


where x m = 
r = 


Pm Qm r m 
lT 


&lat &lon &rud 

Consider an optimal control allocation strategy when employing more control surfaces than the number of states 
to be controlled. Assuming the pair (A\\,B i) is controllable and z is stabilizable, an angular rate feedback dynamic 
inversion controller is computed as shown in Eq. (24) 


M C = S}(S,S7) ' -^K p + ^)x+Gr-A n x-Ai2Z-&J^ 


(24) 


where u c is a control surface deflection command vector. Note the pseudo-inverse required when B\ is non-square. 

-iT 


Let e = 


/o {x m —x)dx X m -x 


equation is given by Eq. (25) 
where Eqs. (26) and (27) 


be the tracking error and set the input gains to one, then the tracking error 


e = A m e + B 


(©[<*>-£) 


A m = 


0 / 
-Ki -K p 


B = 


Let Q = 27, then the solution of the Lyapunov equation, Eq. (11), yields Eq. (28). 

^-l 


P = 


f _1 

Kr l 


K7'Kp + Kp'{K, + I) 


Kf' 

Kp'(t+Kr') 


>0 


(25) 

(26) 

(27) 

(28) 
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A m ’ is computed to be Eq. (29). 


(29) 


AI 1 = 


-K~'K P -K\ 


-l 


Evaluating the term B r PA m ] B of the adaptive law shown in Eq. (10) yields Eq. (30). 

B J PA~ l B = -K~ 2 < 0 

Finally, applying the adaptive optimal control modification, the weight update law is then given by Eq. (3 1 ). 

©, =-r<i)^ T f , B+v<f> T 0 1 /s:f 2 ) 

Changing v affects the damping of the closed loop response and will be shown in the results section. 


(30) 


(31) 


III. Simulation Results for F- 18 Aircraft Model 

To test an adaptive system, a failure needs to be simulated; however, in the "real world" the failure will not be 
known or at best can be isolated to a subsystem. Most failures are typically covered by aerodynamic failures or control 
effectors malfunctions. A "real world" failure is a shift in the center of gravity (eg) and is presented below. A control 
surface failure is another "real world" type of failure. The control failure that is shown in this paper is for illustration 
purposes and is not intended to represent a true failure. 

This section presents results from simulations of [A] matrix and [B] matrix failures. An aerodynamic type of 
failure [A] is inserted first with a large eg aft shift; this corresponds to a change in C ma . The second type of simulated 
failure, which represents a surface failure, inserts a jammed left stabilator, which is the [B] matrix failure. Results from 
the simulation runs are presented to illustrate the advantages of adaptation compared to the baseline dynamic inversion 
without adaptation. The flight condition used in this paper is a test point of Mach 0.5 and an altitude of 15,000 ft. All 
of the pilot inputs to the simulation time histories are from “canned” piloted stick inputs and no attempt to correct for 
the aircraft attitudes are added to the piloted inputs. This “canned pilot input” method was used only for comparison 
puiposes. For instance, when a failure is imparted on the aircraft and the resulting attitudes change minimally, the 
control system is said to have good restoring properties. All the test cases have a one frame delay (1/100 second) at 
the actuators for added realistic implementation purposes. This delay may not be adequate for some designs. 

A. [A] Matrix Failure (Aerodynamic Type Failure) 

The first case is an [A] matrix failure imposed on the aircraft with a destabilizing eg shift or a C ma change. Figure 2 
shows a 40-second time history in which 3 longitudinal pilot stick inputs are presented and the failure is imposed at 13 
seconds. In the first 13 seconds a normal health response shows how the pitch rate follows the commanded pitch rate 
(green) and the stick command (black). After the failure is inserted, the response without adaptation shows that the 
aircraft is stable but with low damping and 2 overshoots (blue). With adaptation on (red), the response is much better 
and follows the commanded pitch rate. By the third pilot input, the adaptation response is close to the commanded 
pitch rate. Notice the low tracking error between q ref and q before the failure and the better tracking response with 
adaptation after the failure. Figure 2 also shows the angle of attack and normal acceleration responses associated with 
the C ma change; and that the system behaves better with adaptation than without. 

Figure 3 shows the tracking error in the roll, pitch and yaw axes along with the adaptation weights. The errors are 
better with adaptation and the weights are convergent. The results show that the adaptation helps with respect to the 
tracking task ( q command) and increases the damping. 

Figure 4 shows the surface positions with and without adaptation. The actuator models are high-fidelity fourth- 
order models with time delays. As expected, the surfaces are well-damped with adaptation on. After observing the 
weights and how they converge, the control surfaces, and the tracking errors, the final analysis for the [A] matrix 
failure example shows that adaptation helps compared to the no adaptation case. 
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Time, s 


Figure 2. Time history of longitudinal states due to an [A] matrix failure C,„ a shift at 13 seconds with and without adaptation. 
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Figure 3. Time history of tracking errors due to an [A] matrix failure C nia shift at 13 seconds with and without adaptation. 
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Time, s 


Figure 4. Time history of control surfaces due to an [A] matrix failure C ma shift at 13 seconds with and w ithout adaptation. 


B. [B] Matrix Failure (Control Surface Jammed) 

The second case is a [B] matrix failure imposed on the left stabilator 13 seconds into the simulation run. The left 
stabilator is jammed (or locked) at +2.5 degrees from trim. Figure 5 shows a 40-second time history of the longitudinal 
responses. During the first 13 seconds the pitch rate follows the commanded pitch rate, but after the failure insertion 
there is a large downward motion and the system cannot track very well. 

Aircraft response comparison with this [B] matrix failure shown in Fig. 5 indicates a better response with adap- 
tation on. Pitch rate follows the reference better with adaptation on. The lateral-directional responses from the same 
longitudinal command also show better aircraft response with adaptation, as shown in Fig. 6. The roll rate with 
adaptation is smaller than without adaptation. The bank angle and sideslip angle both come back to wings-level 
with adaptation but stay at 10 degrees and 8 degrees, respectively, without adaptation. Note that there are no lateral- 
directional pilot inputs ( Prefer ref are zero). Figure 7 shows smaller tracking errors and converging neural networks 
weights. Analysis indicates the system is stable and the performance is better with adaption. 
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Time, s 


Figure 5. Time history of longitudinal states due to a [B] matrix failure (stabilator jammed at 2.5 degrees at 13 seconds with and without 
adaptation. 



Time, s 


Figure 6. Time history of lateral/directional states due to a [B] matrix failure (stabilator jammed at 2.5 degrees at 13 seconds with and 
without adaptation. 
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Figure 7. Time history of tracking errors due to a [B] matrix failure (stabilator jammed at 2.5 degrees at 13 seconds with and without 
adaptation. 


C. [A] Matrix Failure for Variable Learning Rate (O and Damping (v) Test Cases 

This section shows the adaptation rate gamma (T) can be increased and the aircraft will remain stable. This optimal 
control modification method enables fast adaptation with good damping. The test case changes the adaptation rate 
from 0.5 to 50 while keeping the damping term (v) constant at 1 . Figure 8 shows the same [A] matrix failure as that 
presented in subsection A, but the failure shown in Fig. 8 occurs at 2 seconds instead of at 13 seconds and is followed 
by a pitch input. As Fig. 8 shows, the pitch rate tracking error is large with a T of 0.5 compared to 50. The weights 
are also shown, and the larger adaptation rate increases the size of the weights as expected. Note that in both cases the 
system weights are convergent and the tracking error is better with the larger adaptation rate. 

Figure 9 shows what happens when the damping term is changed (v from 0.25 to 1) while keeping the adaptation 
rate constant (T = 5). The tracking error has low damping with the lower v term of 0.25 as expected. In both cases the 
weights converge to reasonable values. The results and analysis show that larger adaptation rates can be tolerated with 
this optimal control modification technique. The damping term can be used to tune the desired performance. 
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Figure 8. Time history of pitch rate error and weights due to an [A] matrix failure C ma shift at 2 seconds; learning rate r of 0.5 and 50 w ith 
fixed damping term v = 1. 




Figure 9. Time history of pitch rate error and weights due to an [A] matrix failure Crn a shift at 2 seconds; fixed learning rate r = 5 and 
variable damping term v of 0.25 and 1. 


IV. Conclusions 

This study presents a new modification to the standard model-reference adaptive control based on an optimal con- 
trol formulation of minimizing the J£?2 norm of the tracking error. The optimal control modification adds a damping 
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term to the adaptive law that is proportional to the persistent excitation. The modification enables fast adaptation with- 
out sacrificing robustness. The modification can be tuned using a parameter v to provide a trade-off between tracking 
performance and damping. Increasing v improves damping to unmodeled dynamics. Simulations of a damaged F-18 
aircraft showed improvements in performance when compared to the baseline controller. 


V. Appendix 


Given a nonlinear plant as in Eq. (32) 


x(t)=Ax(t)+B[u(t)+f(x(t))] (32) 

where A(f) : [0,oo) -> R n is a state vector, u(t) : [0,oo) — > RP is a control vector, A G R nxn and B G R nxp are known 
such that the pair (A, B) is controllable, and f(x(t)) : R' 1 -» R p is a matched uncertainty. 

Assumption: The uncertainty f{x(t)) can be linearly parametrized using a set of basis functions in the form as 
Eq. (33) 

/(*(')) = E (Jt(0) + e (Jf(0) = &* J &(x(t)) + e(x(t)) (33) 

1=1 

where 0* G R ,nxp is an unknown constant weight matrix that represents a parametric uncertainty, (a ( t )) : R n — > R m 

is a vector of known bounded basis functions that are continuous and differentiable in x , and e (a (t)) : R' 1 — ► R p is an 
approximation error. 

The set of basis functions <P(x(t)) is chosen such that the approximation error e (a ( t )) is small such that ||e (A'(f))|| < 
€o for all x (t) G®cR". The universal approximation theorem for sigmoidal neural networks by Cybenko can be used 
for selecting a good set of basis functions <t> (a (t )). 1 Alternatively, Micchelli’s theorem provides the theoretical basis 
for a neural network design of 0 T (t)<t>(x(t)) using radial basis functions to keep the approximation error e(A(r)) 
small. 

The reference model is specified by Eq. (34) 

A m (0 = A. m x m {t ) B m r ( t ) (34) 

where x m (t) : [0,oo) -> R n is a reference state vector, r(t) : [0,oo) -* R r e J^oo is a bounded piecewise continuous 
command vector, A m G R nxn is Elurwitz, and B m G R wxr with r < n. Since r(t) is bounded, x m ( t ) can be shown to be 
uniformly bounded such that Eq. (35). 


||A m (0) II < £ =* Ha w ( t ) II < 8 (e ) , Vr > 0 (35) 

The objective is to design a full-state feedback adaptive control to enable a (t) to follow x m (t) with the controller 
shown in Eq. (36) 

u{t) = K x x (t) + K r r (t) - Had (0 (36) 

where K x G R pxn and K r G R pxr are gain matrices and Uad (t) G R p is a direct adaptive signal which estimates the 
parametric uncertainty in the plant such that Eq. (37) 

IW(0=© T (0 <*>(*(')) (37) 


where Q(t) G R mxp is an estimate of the parametric uncertainty 0*. 

Assumption 2: There exist K x and K r such that the model matching conditions shown in Eqs. (38) and (39) are 
satisfied. 

A m =A-\-BK x (38) 

B m = BK r (39) 

Let 0 (t) = 0 (t) — 0* be an estimation error of the parametric uncertainty and define the tracking error as e (f) = 
x m (t) — A(r), then the tracking error equation becomes Eq. (40). 


e (t ) = A m e ( t) + B ® T (t)<P(x(t))-e(x (t)) 


(40) 


Proposition : The adaptive law shown in Eq. (41 ) provides an update law that minimizes ||e(0llj^ 2 


© ( t ) = -r4 > (jc (f)) [<- T (i t ) P - v<J) T (x (t)) 0 (t) B J PA~ l 


B 


(41) 
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where F = F J > 0 e W nxm is an adaptive gain matrix, v > 0 G M is a weighting constant, and P = P T > 0 G R nxn 
solves Eq. (42) 

PA m +A J m P = -Q (42) 

where Q=Q J > 0 e M" x ". 

Proof: The adaptive law seeks to minimize an infinite-time horizon cost function as Eq. (43) 

J= lim \ f f [e(t)-A(t)] J Q[e(t)-A(t)]dt (43) 

2 J 0 

subject to Eq. (40) where A represents the unknown lower bound of the tracking error shown in Figure 10. 



Figure 10. Tracking error lower bound. 


J is convex and represents the distance measured from the normal surface of a ball B r c @ C with a radius A. 
The cost function is designed to provide robustness by not seeking the tracking error that tends to zero but rather to 
some lower bound away from the origin. By not requiring e (t) — > 0 as t — >• <», the adaptation can be made to be robust. 
Therefore, the tracking performance can be traded with robustness by a suitable selection of the tuning parameter v. 

This optimal control problem can be formulated by the Pontryagin’s Minimum Principle. Defining a Hamiltonian, 
Eq. (44), 


//(e (f),®(0) = l -[e(t)-A(t)} J Q[e(t)-A(t)} 


where p(t) : [0,©o) — > R n is an adjoint variable, then the necessary condition gives Eq. (45) 

p{t) - -VHj = - Q[e(t)-A(t )] -A J m p(t) (45) 

with the transversal ity condition p ( tf — > oo) = 0 since e (0) is known. 

The adaptive law can then be formulated by a gradient update law as Eq. (46). 


&(t) = -rv%r = -r<j>(*(0) v/v* = -r <*>(*(/)) p T ( t)B (46) 

The solution of p(t) can be obtained using a “sweeping” method by letting p(t) — P(t)e{t) + S (r) © T (t) <t> (x (;)). 
Then, Eq. (47) 


P{t)e(t) + P(t) (A m e {t ) + £© t (t ) 4> (x (t)) - fi©* T 4> (jc (»)) - Be (x (t ) )) + $ (t ) © T (t) <t> (x (t ) ) 

d (q t (t) <t> (x (t))) T r t 

+ 5(0 V A =-Q[e(t)-A (r)] -Al [P (t)e(t)+S(t ) © T (r) $ (* (t )) 


(47) 


which yields Eqs. (48) through (50) 


P(t) + P(t)A m +AZP(t) + Q = 0 
S(t) + P(t)B+AZS(t) = 0 


GA(0- 


, d(® T (t)<P(x(t))) 

•5(f)— ^ — +P{t)B 


dt 


©* T <&(;t(f)) + £(jt(r)) 


= 0 


(48) 

(49) 

(50) 
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subject to the transversality conditions P (tf -* oo) =0 and S (tf —>> 00 ) =0. 

Introducing a time-to-go variable t = tf — f, then the Lyapunov differential equation shown in Eq. (48) becomes 
Eq. (51) 

-^p-+ p (T) A m +Al p (T) + Q = 0 (51) 

subject to the initial conditions in time-to-go P( 0) = 0, for which the existence and uniqueness of its solution is 
well-established. 

It follows that Eq. (49) also has a stable, unique solution in time-to-go since A m is Hurwitz, Eq. (52), 

-^+P(r)B+A T m S(z) = 0 (52) 

a T 

subject to 5(0) = 0. 

The solutions of P( t) and 5 (t) as r °o approach to their steady-state solutions as shown in Eqs. (53) and (54). 

PA m +AlP = -Q (53) 

S = -A~ J PB (54) 

Without any loss of generality, a weighting constant v > 0 £ R is introduced as a gain to allow for adjustments of 
the modification term in the adaptive law, where v = 1 corresponds to an optimal solution. Thus, Eq. (55). 

5 = —vA~ J PB (55) 

The adjoint p is then obtained as shown in Eq. (56). 

p(t) =Pe(t)-vA- T PB& T (t)<S>(x(t)) (56) 

Since 0* is constant, it follows that the adaptive law, shown in Eq. (41), is obtained from Eqs. (46) and (56). 
Theorem: The adaptive law shown in Eq. (41) results in stable and uniformly ultimately bounded tracking error 
e(t ) for all (?(0) ,0(0)) e Wp with an ultimate bound as shown in Eq. (57) 


P = 


Una, (W+Wr-V 2 


where Eqs. (58) and (59) 


£ = 


Inin (P) 

2\\PB\\e 0 


Kin (Q) 
2||g T PA- | g||0Q 
Kin (b t A~ t J QA~' B) 

with £o = sup, ||e(r)|| and ©o = ||©*||. 

Proof: Choose a Lyapunov candidate function as shown in Eq. (60). 


(57) 

(58) 

(59) 


(60) 


V (/) = e T (t)Pe(t) + trace (© T (r)r _l 0(r)) 

Evaluating V yields Eq. (61). 

V(t) = e T (t) (A m P + PA m )e(t) + 2e J ( t)PB [© T (f )) -e(x (f ))] 

-2trace(© T (r)4>(jr(r)) [e T (t)PB-v<t> J (x(t))@(t)B T PA~' 6’j ) (61) 

Using the trace identity trace (A T B) = BA J , V ( t ) can be written as shown in Eq. (62). 

V(t) = -e T (t)Qe(t) + 2e J (t) PB © T (r ) < 1> (jr (f )) - e {x (r))j - 2e J (t) PB@ r (t) 4> (jc ( r )) 

+ 2v«J> t (jc(r))0 ( t)B T PA-'BQ T (?) 4 > (x (t)) (62) 
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The sign-definiteness of the term PA m x is now considered. Recall that a general real matrix G is positive (negative) 
definite if and only if its symmetric part G — \ (G + G t ) is also positive (negative) definite. Then, by pre- and post- 
multiplication of Eq. (42) by A~ T and A" 1 , respectively, PA~ ] can be decomposed into a symmetric part M and 
anti-symmetric part N as shown in Eq. (63) 

PA-'=M + N (63) 


where Eqs. (64) and (65) 


M= 2 ( A " JP + PA »') = ~2 A ™ JQA " 


N-- 


\PAZ'-A- J py 


(64) 

(65) 


Since the symmetric part M < 0, then PA m l < 0. Thus, V ( t ) becomes Eq. (66). 


V ( t ) = -e T (t)Qe(t)- 2e J (t)PBe(x(r)) - v<t> T (x(t))®(t) B T A; n J QA; n 1 BQ J {t)<P(x{t)) 

+ 2v4> t (x(t))& (f ) B J NB@ r (*)<&(*(/)) + 2v<t> J (x ( t))& f B J PA ~ 1 B& J (t)4>(x(t)) (66) 

Letting y(t) = B@ J (f)4>(jr(r)) and using the property y T (t)Ny(t) = 0 for an anti-symmetric matrix N, V ( t ) is 
reduced to Eq. (67) 

V = -e T (t)Qe(t)-2e T (t)PBe (jt(f)) - v<t> T (x{t))&(t)B T A~ T QA-'B® r (t)<P(x(t)) 

+ 2v<t> T (x ( t))®*B J PA~ 1 BQ J (t) <t> (jt (?)) (67) 

which is bounded by Eq. (68). 

?(')<“ ll« (Oil [^min m \\e (OH - 2 ||/»fl|| <9o] 

-v||<l»(^(r))|| 2 ||©(r)|| [A min (B T A- J QA-'B) ||©(r)||-2 


b t pai'b 


©o 


( 68 ) 


For the tracking error e (t) and the weight variation 0 (t) to be bounded, V (t) < 0. Let W be a compact set that 
contains the origin at e (t) = 0 and 0(f) = 0 and such that Eqs. (69) and (70): 


2||P5||e 0 
"" KniniQ) 

2 ||g T fl4~ l g|| ©o 

Amin {B T A- J QA m 'Bj 

It follows that V < 0 for all (c. 0. A) £ S? \ W . where Eqs. (71) and (72): 

y = { (e,©) e r x M mxp x' e@cr} 

JP={(e,®) 61" xR mxp : ||e|| < £ or ||©|| < ic} 

Let Eq. (73) 

« = A mwc (P) C, 2 -\-A max (r _1 ) k 2 
be the maximum value such that W C Wa where Eq. (74): 

W a = { (e , ©) Gl"x R mxp : F < a} 

Let /3 be the minimum value such that Wp c S? where Eq. (75): 

Wp = { (e, ©) € R" x R mxp : V < p } 


(69) 

(70) 

(71) 

(72) 

(73) 

(74) 

(75) 


(70). 


Furthermore, let j5 > a be the maximum value such that W a C ‘Wp, then the set containment is as shown in Eq. 


w c c c y 


(76) 
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Let p = sup, ||<?(0|| be the ultimate bound. Then Eq. (77): 


Kin (P) lk || 2 < Kin (P) \\e\\ 2 + Kin (r" 1 ) ||©|| 2 < V < a (77) 

Hence, Eq. (78). 

p = vtw (78) 

Since V < 0 for all (e,0) and W C C C «^, therefore V is a decreasing function of time outside 

of Wp. Thus, if (e (0) ,0(0)) € then according to Corollary 5. 1 of Ref., the solution will remain inside for 
all t > T > 0. Therefore, e (t) is uniformly ultimately bounded with an ultimate bound p. 

Remark: The effect of the optimal control modification is to add damping to the weight update law so as to reduce 
high-frequency oscillations in the weights. The damping term depends on the persistent excitation condition. With 
persistent excitation, the weight 0 is exponentially stable and bounded. This scheme is contrasted to the well-known 
a- and e- modification methods and other variances which also add damping terms to prevent parameter drift in 
the absence of the persistent excitation. These adaptive laws are compared as follows: 


Modification 

Adaptive Law 

a- 

© (r) = -r [<t> (jc (r ) ) e T (r ) PB + CT© (r )] . a > 0 

£- 

&(t) = ~r\<&(x(t))e T (t)PB + n \\e T (t)PB 0 

Optimal 

© (t ) = -r [4> (;t (0) e T ( t)PB - v<i> T (jc (r ) ) © (r ) B J PA~' B ] , v > 0 


Table 1. Modifications to model-reference adaptive law. 

The lack of robustness was well studied ’ and subsequently motivated the development of the a- modification 
and £-modification. ' In this context, the parameter v plays the same role as the parameters a in the a- modifica- 
tion and ju in the e-modification that provides a trade-off between tracking performance and stability robustness. Thus, 
v has to be selected small enough to provide a desired tracking performance, but large enough to provide sufficient 
robustness against time delay or unmodeled dynamics. 
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Outline 


Full-scale Advanced Systems (FAST) Airframe 


♦ 

♦ 


♦ 


Motivation / Problem statement 

Optimal Control Modification 

• Background 

Stability for Fast Adaptation 

Flight Control Application 



Damaged F-18 results [A] & [B] matrix failures 
Faster Adaptation rate & damping test case results 




Motivation 



♦ Conduct multidisciplinary research to develop integrated capabilities for achieving stability, 
maneuverability, and, safe landing, in the presence of adverse conditions 


Adverse conditions 

Airframe damage such as wing loss 

• Actuator failures 
Aerodynamic upsets 

• Icing 

• Scope of adaptive control research 

Novel adaptive control methods 

• Stability metrics 
Metrics-driven adaptive control 

Integrated flight/propulsion/structure adaptive control 




Problem statement exam| 




► The above were survivable accidents; Can Adaptation help more. 

► Objectives 

° Regain a Stable Platform 

Evaluate Robustness metrics for nonlinear adaptive systems 
° Maneuverability (can you fly it around) 

Control vehicle within new constraints / structural loads etc.. 
° Provide the ability to safely land the airplane 
^ • Develop safest recovery trajectory 




Motivation 


Safe flight under adverse conditions need actions at different time scales 

• Aircraft stabilization - inner loop 
Maneuverability in reduced flight envelope - outer loop 

Safe landing - flight planning and flight management system (FMS) 

♦ Failures/damage can introduce abrupt, large aerodynamic changes that 
require fast adaptation to maintain aircraft stability and performance 

• Fast adaptation implies large adaptive gain in presence of persistent excitation 

♦ Fast adaptation can be problematic 

• High-frequency oscillations 

» Lower tolerance to time delay and unmodeled dynamics 
Actuator dynamic constraints may limit how fast to adapt 

Recent development of LI adaptive control (Cao and Hovakimyan) offers a 
new possibility for robust fast adaptation 

This study explores another possibility for fast adaptation without attendant 
problems associated with conventional MRAC 



Background: Optimal Control 

"Modification 


Given a nonlinear plant 


x(t) = Ax(t) + B[u(t ) + f(x(t))\ 


Assumptionl : The uncertain f(x(t)) can be Darameterized usina a basis function in the following form. 

f = it Wi (*(')) + £ (x(t)) = ®* J< S>{x(t))+£(x(t)) 


i=\ 


© is a unknown weight matrix / 

Q <P is a vector of known bounded basis functions (continuous & differentiable in x) 
e is an approximation error 
The reference model is: 

x(t) = A m x(t) + B m r{t) 

Since r(t) is bounded, xm can be shown to be uniformly bounded such that: 

ll x m(°)ll < £ => llx m (t)|| < 6(e), Vt >0 


The objective is design a feedback adaptive control to enable x(t) to follow x m (r ) 

u(t) = K x x(t) + K r r(t) - u ad (t ) 

Had (0 € R p is a direct adaptive signal which estimates the uncertainty the plant 



Background: Optimal Control Modification 



Assumption2 : There exist Kx & Kr such that the model matching conditions shown below are satisfied. 

Am = A + BKx 
Bm — BK r 

Let 0(f) = 0(r) — 0* be an estimation error of the parametric uncertainty and define the tracking error 

e(t) = xm(t) - x(t) 

e(t) = A m e{t) + B[ 0 T (t)<p(*(t)) - fr(x(t))] 


The adaptive law: 



-nt>(*(r)) 


e T (t)P-v< P (x(t))Q(t)B PA~' 


B 



Comparison of Modification Schemes 


Modification 

Adaptive Law 

(7- 

0(f) = — T [<t>(x(f))e T (t)PB + cr®(t) 

, a > 0 

e- 

0(f) = _r[<l>(;c(f))e T (f)/ > B + jU e J (t)PB 

®(0], ju >0 

Optimal 

0(f) = -r[<t>(x(f))e T (t)PB-v4> J (x(t))Q(t)B J PA-'B], v >0 


Table 1 . Modifications to model-reference adaptive law. 


loannu 


Neural Net Dynamic Inversion Control 



♦ Neural net MRAC command augmentation (Calise) 



Acceleration command feedforward and PI feedback controller for rate retention 
Xd = A m x m + B m r + K p (x m i x)+Ki (x m j x)d^i u ad 


K P = 


diag 


2 . | 2 . | 2 

1 , ■ 2 > ■ 3 


K, 


o 

diag(2» 1 ! 1 ;2» 2 ! 2 ;2» 3 ! 3 ) 


Typical values of ' 1 are 3.5, 2.5, 2.0 rad/s for roll, pitch, yaw; and >>] ^2 

u = B‘ 1 (xj j Ax j Bu j Gz) 

Dl control to cancel out plant dynamics 



Trade-off 


♦ Adaptive control must strive a balance between performance and robustness 


Transient Performance 



Optimal Control 
Modification 



Stability Robustness 


♦ Weighting parameter 0 must be small enough for good transient performance, 
and yet large enough for stability robustness 



F-18 Flight Control Application 

♦ Adverse Failure condition setup: 

♦ All the F-18 simulation results include: 

• 4th order actuators with rate & 
position saturation 

• Time delay (1 frame) 




The following [A] matrix failure will show a destabilizing eg shift. 
• CG Change: 

• M .75 & 20,000 ft. 

• CG shift at 1 3 seconds 

3 longitudinal inputs (one before failure & 2 after) 





[A] matrix failure: CG shift @13 seconds 
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[A] matrix failure: CG shift @13 seconds 
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Time history of tracking errors due to an [A] matrix failure C ma shift at 13 seconds with and w ithout adaptation 



F-18 Flight Control Application 

Adverse Failure conditions [B] matrix 

• Actuator failures [B] 




The following [B] matrix failure will show a control surface jam. 
Change is Left Stabilator jammed at +2.5 degrees from trim. 
• M .75 & 20,000 ft. 

Surface Jammed at 13 seconds 
3 longitudinal inputs (one before failure & 2 after) 




[B] matrix failure: Left stabilator jammed at 2.5 deg @13 seconds 
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failure: Left stabilator jammed at 2.5 deg @13 seconds 
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F-18 Flight Control Application 

♦ Faster adaptation rate change with failure 

♦ Will show better performance with 

♦ Larger weights 




The following shows the same [A] matrix CG shift 
• CG Change: 

• M .75 & 20,000 ft. 

• CG shift at 2 seconds 

• 1 longitudinal inputs 
Adaptation rate change from .5 to 50. 




Adaptation rate change from .5 to 50. with a shift in CG at 2 seconds 






Time, s 


Conclusions 



♦ A new modification method to MRAC based on optimal control approach has 
been developed 

♦ Fast adaptation can be achieved with new adaptive without attendant high- 
frequency oscillations and reduced tolerance to time delay 

♦ Transient performance and robustness can be traded-off by tuning weighting 
constant in adaptive law 

♦ Practical considerations such as actual F-18 actuators rate limits and time delay 
results were shown 

♦ Future task 

♦ Test this design in full nonlinear piloted simulation 



